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We present a search for associated production of the standard model (SM) Higgs boson and a 
Z boson where the Z boson decays to two leptons and the Higgs decays to a pair of b quarks in 
pp collisions at the Fermilab Tevatron. We use event probabilities based on SM matrix elements 
to construct a likelihood function of the Higgs content of the data sample. In a CDF data sample 
corresponding to an integrated luminosity of 2.7 fb _1 we see no evidence of a Higgs boson with a 
mass between 100 GeV/c 2 and 150 GeV/c 2 . We set 95% confidence level (C.L.) upper limits on the 
cross-section for ZH production as a function of the Higgs boson mass mn\ the limit is 8.2 times 
the SM prediction at win = 115 GeV/c 2 . 
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In the standard model (SM), the Higgs mechanism is 
responsible for the observed breaking of the SU(2) l <& 
U(l) symmetry [IJ [2], yet the Higgs boson remains the 
only SM particle that has not been directly observed. 
Direct searches have set a lower limit on the SM Higgs 
boson mass m H of 114.4 GeV/c 2 at 95% C.L. |3J, while 
precision electroweak measurements indirectly constrain 
its mass to ran = 76^4 GeV/c 2 [4]. At hadron colliders 
the dominant production process for the SM Higgs boson 
is gg — * H while its decays are dominated by H — ► bb for 
mji < 140 GeV/c 2 . However, the process gg — ► H — > bb 
is dwarfed by multi-jet background, necessitating the 
search for Higgs bosons produced in association with a W 
or Z boson that decays leptonically. This article reports 
a search for the process pp — > ZH — > £~£ + bb (£ = e, /x) in 
data with an integrated luminosity of 2.7 fb _1 collected 
with the CDF II detector, nearly 3 times that of the pre- 
viously reported analysis [5] . The study of Higgs boson 
production in association with a W/Z gauge boson for 
low Higgs boson masses is further motivated by the fact 
that the signal to background ratio is more favorable at 
the Tevatron compared to the Large Hadron Collider. 

For the first time in a ZH — > £~£ + bb search, we uti- 
lize a method based on leading-order matrix element 
calculations [SJ [5] convoluted with detector resolu- 
tion functions [9] that form per-event likelihoods. This 
method, pioneered for use in top quark mass measure- 
ments [TUJ [TT], has been recently used in Higgs boson 
searches in other decay channels [T^] by forming a dis- 
criminating per-event variable. We extend the technique 
by expressing the event likelihoods as a function of the 
ZH signal fraction and maximizing the joint likelihood 
for the data sample with respect to the signal fraction. 
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The CDF II detector [U D3] is an azimuthally 
and forward-backward symmetric apparatus designed to 
study pp collisions at the Fermilab Tevatron. It con- 
sists of a magnetic spectrometer surrounded by calorime- 
ters and muon chambers. The charged particle tracking 
system, consisting of a silicon detector and drift cham- 
ber, is immersed in a 1.4 T magnetic field parallel to 
the p and p beams. Calorimeters segmented in 77 and 
4> surround the tracking system and measure the energy 
of particles detected within them. The electromagnetic 
and hadronic calorimeters are lead-scintillator and iron- 
scintillator sampling devices, respectively. Drift cham- 
bers located outside the central hadron calorimeters de- 
tect muons. The data used in this analysis are col- 
lected with an online selection that requires events to 
have a lepton with Et > 18 GeV (for an electron) or 
p T > 18 GeV/c (for a muon) [T4"] . 

The event selection used in this analysis closely follows 
that in Ref. [5J. Candidate events are required to have 
a pair of oppositely charged electrons or muons with in- 
variant mass 76 < mu < 106 GeV/c 2 . Candidate events 
are also required to have one jet with Et > 25 GeV 
and at least one additional jet with Et > 15 GeV, both 
within 1 77 1 < 2.0. All jet energies are corrected for non- 
uniformities in calorimeter response, effects from multiple 
pp interactions and for the hadronic energy scale of the 
calorimeter |15j . Candidate events are required to have 
at least one jet with an associated displaced secondary 
vertex [16] ( "6-tags" , reconstructed using tracks with hits 
in the silicon detector), thus enriching the 6-quark con- 
tent of the sample. 

The backgrounds for this analysis are dominated by 
events with real Z bosons with additional contributions 
from ti and events where an object, such as a jet, is 
mis-identified as a lepton. We model the backgrounds 
with events generated with leading-order event genera- 
tors, normalized to next-to-leading order cross-sections 
and simulated with a GEANT-based description of the 
CDF II detector [9J . Z+light-flavor jet contributions are 
modeled with the ALPGEN [T7] simulation code matched 
with PYTHIA in the MLM scheme [T7] for the hadroniza- 
tion and fragmentation. Heavy flavor contributions from 
Z + bb and Z + cc are modeled separately with ALP- 
GEN and combined with the light-flavor jet samples. The 
WZ, ZZ and ti processes are modeled using PYTHIA [Tg] . 
Events where a jet is mis-identified as a charged lep- 
ton are modeled using jet-enriched data samples [51 119j. 
We model the kinematics of ZH — ► £ + £~bb events using 
PYTHIA for m H ranging from 100 GeV/c 2 to 150 GeV/c 2 . 
The signal and background contributions expected in 
2.7 fb _1 and the number of observed events are given 
in Table |H 

We denote the ZH signal probability by Pz//(xi|m_f/) 
where m# is a parameter and Xi represents the collection 
of the measured 4-vector momenta of the two selected 
leptons, the two selected jets, and the two components 
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TABLE I: Expected and observed numbers of events with 
1 or 2 6-tagged jets in 2.7 fb _1 of data. The ZH expecta- 
tion is shown for win = 115 GeV/c 2 assuming the produc- 
tion cross section at <Js = 1.96 TeV for qq — > Z* — > ZH 
to be 1.04 pb [2U] and the branching ratio B(H — > 66) to be 
73% El- 



Source 


1 tag 


> 2 tag 


Z -> £ + 7~±light partons 


129.6 ±24.0 


5.5 ±0.9 


Z -> £+£- + bb, cc 


107.2 ± 14.0 


19.5 ±3.4 


zz, wz 


11.6 ± 1.3 


2.9 ± 0.4 


ti 


13.9 ±2.0 


7.7 ± 1.1 


Mis-ID lepton 


15.9 ±6.5 


0.4 ± 0.2 


ZH 


1.3 ±0.2 


0.7 ±0.1 


Total expected 


279.5 ±28.6 


36.3 ±3.7 


Data 


258 


32 





0.5 1 1.5 

tan-ftP -PJ/L] 



of the missing transverse momentum, in a given event 
i. Similarly we denote the background probability as 
Pfc(xi). The per-event likelihood as a function of the 
signal fraction s for a given event i is 



FIG. 1: The distribution of tan" 1 T> where the discriminant 
V = (Pzh — Pb)/L for expected backgrounds and data for 
events with one (left) and two (right) 6-tags. The expected 
signal ( x 10) is overlaid. 



L(s,Xi\m H ) = sP ZH (xi\m H ) ± (1 - s)A(xi). (1) 

We evaluate Pzh an d Pb by convoluting the leading- 
order matrix elements for the process with detector reso- 
lution functions and integrating over unmeasured quan- 
tities. Thus, Pzh is a probability density in Xi and can 
be expressed as 

P Z H(xi\m H ) = 1 / d^>\MzH(q,p;m H )\ 2 
crlrriHl I 

1 ; (2) 
x Y[[W(p j ,x i )}fp DF (q 1 )f PDF (q 2 ) 

3 

where M.zh is the leading-order matrix element for the 
process qq — > ZH — > £ + £~bb evaluated for a pair of in- 
coming partons q and outgoing particles p, W(pj, Xi) are 
transfer functions [22] linking the outgoing particle mo- 
menta pj to measured quantities Xj and the fpDF arc 
parton density functions of the incoming partons. The 
factor l/a^mn) ensures that the probability density sat- 
isfies the normalization condition, f dx^ Pzff(xi|m#) = 
1. 

The sample likelihood C is obtained by taking the 
product over all events i in the sample 

£(s\m H ) = Y]_L(s,Xi\m H ). (3) 

i 

We enhance our statistical sensitivity by exploiting the 
expected difference in the rate of signal and background 
events with two 6-tagged jets. We replace PzH^^mn) 
by P Z H(x-i,n\m H ) = PzH(xi\m H ) ■ PzH(n\m H ) and 
Pfc(xi) by P b (x ; ,n) = P b (x ; ) ■ P b (n), where PzH(n\m H ) 



TABLE II: Expected single- and double-tag probabilities, 
P(n — 1) and P(n > 2), for signal and background events 
passing our selection. Z — > £ + £~ +]ets includes jets from both 
light and heavy quarks. 



Source P(n = 1) P(n > 2) 

Z -> £+^"±jets 091 O09 

WZ,ZZ 0.80 0.20 

tt 0.74 0.26 

ZH (m H = 100 GeV/c 2 ) 0.67 0.33 

ZH (m H = 125 GeV/c 2 ) 0.65 0.35 

ZH (m H = 150 GeV/c 2 ) 0.63 0.37 



[Pb(n)] denotes the probability of tagging signal [back- 
ground] events with n tags. Table [lT| shows the expected 
tagging rates for simulated signal and background event 
samples. 

The measured signal fraction S meas is the value of s 
which maximizes £(s|m#). Using Eq. Q, we can de- 
fine a per-cvent discriminant T>i= dhiL/ds = (Pzh — 
Pb)/L which increases (decreases) for more signal- like 
(background-like) events. The maximum- likelihood esti- 
mator for the measured signal fraction S mea s corresponds 
to T,iT>i\ s= s mcas — 0. The distribution of tan _1 2?;(s = 
Smeas) for simulated events and data is shown in Fig. [T| 

The dominant backgrounds in our data sample are due 
to Z±jets, ti and ZZ processes, in the expected propor- 
tions denoted by Xzjji \t an d ^zz respectively The 
background probability in Eq. ([IJ is given by 

P fc (xi,n) = \zjjPzjj(xi,n)+\ t iPti{xun)+\zzPzz{xi,n), 

(4) 

where Pzjj(xi, n), P t f(xi,n) and Pzz(xi,ri) are the re- 
spective probability densities (normalized to unit inte- 
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gral) for the Z+jets, tt, and ZZ background processes 
with n tags. Normalization of Pb is ensured by requiring 
^Zjj + \i + ^zz = 1. 

We construct confidence intervals 23J for the test 
statistic R = C(S meas \S t rue) / ' C(S meas \S^ e ) by perform- 
ing simulated experiments with the expected proportions 
of background and varying the amounts of signal, such 
that Stme is the true (input) signal fraction in the simu- 
lated experiment. S^ul ^ s the input signal fraction that 
has the highest likelihood for a given measured signal 
fraction, S raeas . £( K S meas \Strue) is given by Eq. Q for 
the simulated experiment with the chosen value of Strue 
and run- Since we are measuring the fractional signal 
content in the data sample, the number of events in each 
simulated experiment is held fixed at the value of 290 
events observed in the data. 

The methodology from Ref. (23] is used to construct 
confidence intervals in S meas for each chosen value of 
Strue and 77i H . This method removes any bias resulting 
from imperfections in our modeling by relating S meas to 
Strue- The confidence intervals in S meas obtained for 
tuh = 115 GeV/c 2 and < Strue < 0.25 are shown in 
Fig. [2j For a given value of S meas obtained from the 
data (or from an independent simulated experiment to 
evaluate the a priori expectation), we extract the range 
of Strue for which the confidence intervals contain this 
value of S meas . A feature of this method is that the 
resulting range of St rue can be quoted as an upper limit 
on Strue (if the lower bound is zero) or as a two-sided 
measurement of Strue- As Fig. [2] shows, we obtain an 
upper limit on Strue given the data, which we convert 
to the equivalent upper limit on the signal cross section. 
This procedure is repeated for the range of Higgs boson 
masses 100 < m H < 150 GeV/c 2 . 

We evaluate systematic uncertainties by varying pro- 
cess rates and kinematic distributions in our simulated 
experiments. We apply a rate uncertainty of 40% for Z 
boson events and of 20% for diboson and tt events. The 
uncertainty on the rate of heavy flavor production in as- 
sociation with a gauge boson is based on comparisons of 
data with theoretical predictions [H]. The uncertainty 
on the diboson and tt contribution includes the uncer- 
tainties in the cross sections, selection efficiencies and 
the top quark mass [5]. A rate uncertainty of 50% is 
applied for mis-identified lepton events due to the un- 
certainty on the lepton misidentification probability |5J. 
A rate uncertainty of 6% due to the luminosity uncer- 
tainty is applied to all events. The per-jet uncertainty 
on the ^-tagging efficiency is 8% for events with b par- 
tons, 16% for events with c partons and 13% for events 
with no heavy flavor [5]. Our analysis is weakly sensitive 
to uncertainties in the expected total number of events 
passing our selection, since it relies only on the shapes 
of measured distributions. Uncertainties in the shapes 
of kinematic distributions are propagated by varying the 
amount of QCD radiation in simulated signal events and 
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FIG. 2: Confidence intervals in the measured signal fraction 
S m eas (along the z-axis) with 68% C.L., 95% C.L. and 99.7% 
C.L., for a range of true signal fraction Strue values (along the 
y — axis on the left) chosen in the simulated experiments. The 
signal cross section ratio equivalent to Strue is shown on the 
y— axis on the right. The intervals shown here are computed 
for a Higgs boson mass of mu = 115 GeV/c 2 , and include 
statistical and systematic uncertainties. The vertical dashed 
line indicates the value of S mens obtained from the data. 



TABLE III: Upper limits at 95% C.L. on the ZH -» l+Tbb 
cross section, shown as a ratio to the SM cross section. The 
column labelled "Expected" shows the median of the limits 
obtained from simulated experiments containing no signal, 
and the columns labelled "±lrr" show the range containing 
68% of the expected limits. 



m H [GeV/c 2 ] 


-la 


Expected 


+lff 


Observed 




[ff/ffSAi] 


[u/osm] 


[ff/ffSM] 


[ff/ffSM] 


100 


6.0 


8.7 


12.4 


7.0 


105 


6.0 


8.7 


12.9 


6.5 


110 


7.5 


11.3 


16.8 


7.6 


115 


8.3 


12.1 


18.2 


8.2 


120 


9.3 


13.5 


20.0 


9.0 


125 


13.2 


18.3 


27.1 


13.2 


130 


17.1 


24.2 


35.7 


17.7 


135 


21.8 


31.0 


44.8 


22.9 


140 


31.0 


44.3 


65.4 


32.0 


145 


42.8 


61.6 


89.9 


43.1 


150 


73.7 


104 


153 


71.3 



the jet energy scale in simulated signal and background 
events within their respective uncertainties |15j . 

We evaluate confidence intervals for a range of Higgs 
masses between 100 GeV/c 2 and 150 GeV/c 2 . We eval- 
uate a priori 95% C.L. upper limits on the cross section 
for the process pp — ► ZH — ► £ + £~bb. We express these 
limits as a ratio with respect to the SM prediction. These 
expected limits along with those observed in the data are 
shown in Table ITTT1 
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In conclusion, we have performed a search for the SM 
Higgs boson decaying to bb produced in association with 
a Z boson. This is the first analysis performed in this 
channel with a matrix element method. The data show 
no excess over expected non-Higgs backgrounds. We set 
95% C.L. upper limits on the cross section of this pro- 
cess for a range of Higgs boson masses. The limit at 
ma = 115 GeV/c 2 is 8.2 times greater than the SM pre- 
diction. This result improves by a factor of 2 over the 
previously published result in this channel [5]. We are 
exploring further improvements in this technique by sep- 
arating the leading-order and next-to-leading order con- 
tributions to the signal and backgrounds, as well as the 
use of matrix-element-based probabilities in conjunction 
with other multivariate discriminants. 
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